Does the h-index have predictive power? 
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Bibliometric measures of individual scientific achievement are of particular interest if they can be 
used to predict future achievement. Here we report results of an empirical study of the predictive 
power of the h-index compared to other indicators. Our findings indicate that the h-index is better 
than other indicators considered (total citation count, citations per paper, and total paper count) 
in predicting future scientific achievement. We discuss reasons for the superiority of the h-index. 

PACS numbers: 



I. INTRODUCTION 

The h-index of a researcher is the number of papers 
coauthored by the researcher with at least h citations 
each[l|. We have recently proposed it as a representative 
measure of individual scientific achievement. Other com- 
monly used bibliometric measures of individual scientific 
achievement are total number of papers published (N p ) 
and total number of citations garnered (N c ). Recently, 
Lehmann et al have are ued[ll|] that the mean number 
of citations per paper (n c = N c /N p ) is a superior indi- 
cator. Here we wish to address the question: which of 
these four measures is best able to predict future scien- 
tific achievement? 

For the purposes of this paper we do not wish to dwell 
on the controversial question of what is the optimal def- 
inition of scientific achievement 1 . We are not interested 
in measuring the past achievement of an individual e.g. 
for the purpose of awarding a prize or for election to 
a prestigious academy, but rather in predicting future 
achievement. So we could simply bypass this question by 
defining scientific achievement by the bibliometric mea- 
sure under consideration, and ask: which measure is bet- 
ter able to predict its future values? For example, how 
likely is a researcher that today has a large number of 
citations to gain a large number of citations in future 
years? To the extent that a bibliometric measure reflects 
particular traits of the researcher rather than random 
events, it should have higher predictive power than an- 
other measure that is more dependent of random events. 
For example, we argued in ref. 1 that the total number of 
citations N c "may be inflated by a small number of 'big 
hits', which may not be representative of the individual 
if he/she is coauthor with many others on those papers" . 
For that individual, the present N c value is not likely to 
be a good predictor of his/her future N c values. 

Alternatively, among the indicators listed in the first 
paragraph, it may be argued that the total number of ci- 



tations N c is the one that best reflects scientific achieve- 
ment, since it gives a measure of the integrated impact of 
a scientist's work on the work of others. Then we would 
like to know: which indicator is best able to predict N c at 
a future time? It is certainly not obvious that the answer 
is N c itself. 

There are two slightly different questions of interest: 
(a) Given the value of an indicator at time t\, I(t\), how 
well does it predict the value of itself or of another indica- 
tor at a future time t%, i'^)? This question is of interest 
for example in trying to decide between different can- 
didates for a faculty position. A possible consideration 
might be: how likely is each candidate to become a mem- 
ber of the National Academy of Sciences 20 years down 
the line? For that purpose, one would like to rank the 
candidates by their expected cumulative achievement af- 
ter 20 years. Which means in particular that citations 
obtained after time t\ to papers written before time t\ 
are relevant, (b) Instead, to award grant money or other 
resources for future research at time t\ one would like 
to rank the candidates by their expected scientific out- 
put from time t\ on to some future time. In deciding 
who should get a grant it should be irrelevant how many 
more citations the earlier papers of that individual are 
expected to collect in future years. 



II. PROCEDURE 

We use the ISI Web of Science database in the 'gen- 
eral search' mode. 2 ISI has recently incorporated tools 
under 'Author Finder' that help to discriminate between 
different researchers with the same name. Once the pub- 
lications of a researcher are identified, ISI provides in the 
'Citation Report' the total number of citations N c , total 



1 Philip Ball, "Is it any good? Measuring scientific 
merit", presentation at Deutsche Physikalische Gesellschaft 
meeting, Regensburg, Germany, March 26-30, 2007, 
http: / / www.agrfoto.com/philipball/docs /pdf/anygood.pdf 



2 In using the very valuable ISI resource for individual evaluations, 
one should keep in mind that it has limitations, e.g.: (i) it will 
of course miss citations where the author's name is misspelled; 
(ii) books, book chapters and most conference proceedings are 
not included; (iii) citations to "Rapid Communications" papers 
in Physical Review that include (R) in the citation are currently 
not counted by ISI. 
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TABLE I: Averages and standard deviations in different 
time frames for the four indicators considered for the sam- 
ple PRB80. The last column gives a = N c /h 2 . 
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FIG. 1: h-index versus time (left scale) and total number of 
citations (N c ) versus time (right scale) for E. Witten (the- 
orist) and M. Cardona (experimentalist). The dashed and 
dash-dotted lines show linear and quadratic fits to h-index 
and N c respectively. 



number of papers N p , citations per paper n c — N c /N p 
and the h-index, all at the present time. 

ISI also allows one to restrict the time frame of the 
papers' publication date, so it is easy to find N p (ti), the 
number of papers published up to year ti, or N p (t\,t2), 
the number of paper published between times t\ and t^. 
However, ISI does not provide a similarly simple way to 
obtain these values for the other indicators under consid- 
eration. To obtain this information, one needs to export 
('save') the information in the Citation Report into an 
Excel file and add up the citations in the time interval 
desired. It is a straightforward but tedious procedure. 

For illustration we show in Fig. 1 the h-index and the 
number of citations as function of time for two promi- 
nent physicists, a theorist and an experimentalist. As we 
conjectured in Rcf. 1, the h-index follows approximately 
a linear behavior with time and the total number of ci- 
tations is approximately quadratic with time. Similar 
behavior is found in many other cases. 



III. SAMPLE PRB80 

Because citation patterns vary between fields and also 
between subfields and there are also trends with time, 
we chose to look at authors in a single subfield and of 
comparable scientific age to compare the predictive power 
of the various indicators. Ideally we would like to pick a 
random subset of all physicists that earned their Ph.D. 



Time frame 


h, a h 


N c , a Nc 




n c , On c 


a, a a 


yr 1-12 


14, 6 


740, 720 


45, 26 


16, 10 


3.5, 1.2 


yr 1-24 


24, 12 


2452, 2471 


90, 51 


25, 14 


4.1, 1.7 


yr 13-24 


12, 8 


759, 831 


45, 33 


13, 10 


3.8, 1.7 



in a given subfield in a given year and published in that 
subfield throughout their career. However we have no 
practical way to make such a selection. As alternative, we 
picked a sample of 50 physicists that started publishing 
around 1980 by the following procedure: 

(i) We considered papers published in Physical Review 
B in 1985 that have today citations in the range 45 to 
60 (an arbitrary choice, simply to avoid extremes). In 
practice we started with papers with 60 citations and 
went as far down as needed to get the number of authors 
desired for our sample. 

(ii) From the authors of those papers, we selected those 
that had published their first paper between 1978 and 
1982. 

Because of the journal used for the selection 
(Phys.Rev.B) the sample contained mostly physicists 
that published in the field of condensed matter physics 
throughout their career. There was however a small sub- 
set of the sample that subsequently switched to other 
subfields. 

We then looked at the publication records of these au- 
thors during the first 12 years of their career (starting 
with their first published paper) and in the subsequent 
12 years. In table I, we show the average and standard 
deviation values of the four indicators considered in the 
first 12 years, first 24 years, and years 13 to 24, all mea- 
sured from the publication year of the first paper. It can 
be seen that h, N p and n c increase by approximately a 
factor of 2 in comparing 12 year and 24 year periods and 
N c by approximately a factor of 4, as expected. The last 
column in the table shows the average and standard de- 
viation of a = N c /h 2 for this sample. In Ref. 1 a was 
observed to be typically in the range 3 to 5. 

As discussed earlier we would like to know: 

(a) How well does the performance during the first 12 
years predict the cumulative achievement over the entire 
24 year period? 

(b) How well does the performance during the first 12 
years predict the performance in the subsequent 12 years? 

Because the number of citations is expected to grow 
quadratically with time, we used \TNc~ as measure of total 
citations. 

First we consider the predictive power of the various 
indicators after the first 12 years (ti) for the cumulative 
achievement in the 24-year period fa)- In Fig. 2 we 
show the total number of citations after 24 years ver- 
sus each indicator after 12 years for each member of the 
sample, and their correlation coefficient r (=covariance / 
product of standard deviations). It can be seen that the 
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FIG. 2: Scatter plot of total number of citations N c after 
<2 =24 years versus the value of the various indicators at 
ii=12 years (t measured from the date of the first publica- 
tion). N c = number of citations, h=h- index, N p =number of 
papers, n c =mean number of citations per paper, r is the cor- 
relation coefficient, r=l corresponding to perfect linear corre- 
lation. 



h-index and the number of citations N c at time t\ are the 
best predictors of cumulative citations at the future time 
t2, with correlation coefficient r — 0.89. The number 
of papers correlates somewhat less (r = 0.74), and the 
number of citations per paper n c has lowest correlation 
with cumulative citations, with r = 0.54. 

According to these results, if one wishes to select 
among various candidates at time t\ the one(s) that will 
have the largest number of citations at the later time t2, 
the h-index or the number of citations at time t\ are good 
selection criteria. A candidate with low h or low N c at 
time t\ will not have a high N c at time t%. Instead, a 
candidate with low N p or low n c at time t\ has a much 
higher chance of ending up with high N c at time ti- 

Figure 3 shows the ability of each indicator to predict 
its own cumulative value. Here, the differences between 
indicators is smaller and the correlation coefficient is high 
in all cases. Still, the h-index shows the largest predictive 
power, with r = 0.91. I.e., a researcher with high h-index 
after 12 years is highly likely to have a high h-index after 
24 years. 

It is more difficult for the indicators at time t\ to pre- 
dict scientific achievement occurring only in the subse- 
quent period, i.e. without taking into account the cita- 
tions after time t\ to work performed prior to t\. As 
discussed, one would like to make such predictions to de- 
cide on allocation of research resources. In figure 4, the 
ability of the indicators at time t\ to predict citations to 
papers written in the t\ — ti time interval is considered. 
The highest correlation coefficient occurs for the h-index 
(r = 0.60), and the lowest for mean number of citations 
per paper (r = 0.21). Similarly, as shown in Fig. 5, the 



FIG. 3: Predictive power of each indicator at time ti=12 
years for the value of the same indicator at time t2—24 years 
for sample PRB80. 

ability of each index to predict itself is largest for the h- 
index (r = 0.61) and lowest for number of citations per 
paper (r = 0.23). 

So if we choose to measure 'scientific achievement' by 
either total citation count N c or by the h-index, these re- 
sults imply that (at least in this example) the h-index has 
the highest ability to predict future scientific achieve- 
ment. In fact, even choosing the number of papers N p as 
the measure of achievement the h-index yields the high- 
est predictive power, as shown in Fig. 6: r = 0.49, versus 
r = 0.43, r = 0.42 and r = 0.092 for N p , N c and n c as 
predictors respectively. In allocating research resources 
(eg grant funding) to otherwise comparable researchers, 
if the goal is to maximize the expected return on the 
investment as measured by either A^ c or h-index or N p , 
we suggest that these results should be considered. If 
one chose instead to use as indicator of scientific achieve- 
ment the mean number of citations per paper (following 
Lehmann et al@, our results suggest that (as in the 
stock market) 'past performance is not predictive of fu- 
ture performance'. 

IV. SAMPLE APS95 

As a second example, we consider the set of 
physicists elected to fellowship in 1995 by the Di- 
vision of Condensed Matter Physics of the Ameri- 
can Physical Society. (The list is available online at 
|http://dcmp.bc.edu/page.php?name=fellows_95[ ) . From 
the list of 29 individuals, two were excluded because it 
was difficult to identify their publications due to name 
overlaps. We evaluated the indicators for this group up 
to the year 1994 (right before being elected to fellowship), 
up to 2006, and in the 12 years from 1995 to 2006. The 
averages and standard deviations are shown in Table II. 
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FIG. 4: Predictive power of each indicator at time t\ = Y2 years 
for the number of citations to papers published in the ti — ti 
time interval, with ^=24 years, for sample PRB80. 



FIG. 6: Predictive power of each indicator at time t\=Yl years 
for the number papers published in the ti — ti time interval, 
with £2=24 years, for sample PRB80. 
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FIG. 5: Predictive power of each indicator at time t\ = Y2 years 
for the value of the same indicator for the papers published in 
the ti — ti time interval, with i2=24 years, for sample PRB80. 



Figure 7 shows the number of citations in the 12 years 
after being elected to fellowship versus each of the in- 
dicators up to the year 1994. The correlations here are 
weaker than in the first example, nevertheless the h-index 
shows a stronger correlation (r=0.49) that all other in- 
dicators. Similarly, Figure 8 shows that the h-index is a 
better predictor of itself (r=0.54) than any of the other 
indicators. 

Incidentally, note the large dispersion in the values of 
the indicators at time t\ (e.g. h ranging from 9 to 43, N c 
from 482 to 7471, N p from 19 to 248), which indicates 
that the APS fellowship committee does not rely (for 
better or for worse) on any of these numerical indicators 
as a deciding factor for election to fellowship. 



TABLE II: Averages and standard deviations in different 
time frames for the four indicators considered for the sam- 
ple APS95. The last column gives a = N c /h 2 . 



Time frame 


h, o h 


N c , a Nc 


N p , crjVp 


n c , (Tn c 


a, a a 


up to yr 94 


22, 7 


1887, 1348 


114, 53 


17, 7 


3.6, 0.9 


up to yr 06 


36, 13 


5741, 4468 


205, 122 


32, 41 


4.2, 1.6 


yrs 95-06 


19, 11 


1718, 1760 


91, 87 


19, 12 


4.1, 2.1 



The data for cumulative achievement up to 2006 are 
shown in Figs. 9 and 10. It can be seen that the pattern 
is similar to Figs. 2 and 3, the corresponding graphs for 
sample PRB80. 

It is easy to understand why the correlations here are 
weaker than in the first example . Scientists are elected to 
APS fellowship at very different stages in their career, so 
the horizontal axis variables in these figures are not time- 
normalized. For example, a member of this group may 
have had a large N c in 1994 because he/she had been 
publishing for many years at a slow rate, and his/her 
productivity in the subsequent 12 years would not be 
expected to be larger than that of another scientist of 
this group that started his/her career much later and 
had a higher publication rate. 

Note also that the 12-year productivity and impact 
of the APS fellows sample (table II) is on average sub- 
stantially higher than that of the random sample PRB80 
(table I), and that there arc no points on the x-axis in 
the figures for the period t\ —t 2 for the APS sample (figs. 
7,8) in contrast to that of the PRB80 sample (figs. 4-6). 
These differences are to be expected from the fact that 
election to APS fellowship is not a random process. 



20 




, I ,-, , , I , , , , I , , , , I , 



10 20 30 40 
hit.) 



i I .*. . . I . . . . I . . . . I . 



20 40 60 80 
N^t,)" 2 



50 — 
: 



r 



52," 



■ I .... I ■ ... I .... I ■ 



10 20 30 40 
h(t,) 



50 




I . i . . I i . . i I i i . i I i 



20 40 60 80 
N c ( tl )" 2 



80 
L 60 

CM 

±f 40 
20 




■ «l 1 




- (c) X 




— r-0.40 \ 


K 


— H 

:» « ,V*» 






H 







50 100 150 200 250 
N,ttj) 




150 



T"T [ ijT"T'"+4 



» X «" 
X 



Jmm I mm I mm I mm I 



150 



50 100 150 200 250 
Npttj) 



idi''"" 
r-0.55 » 

5 

«" x 

x » 

X x x „ 

#*„"" 

I I I I I I I I I 



10 20 30 

n c (t,J 



FIG. 7: Predictive power of each indicator at year 1994 for 
the number of citations to papers published in the 1995 — 2006 
time interval for the sample APS95. 



FIG. 9: Predictive power of each indicator at year 1994 for 
the number of citations to all papers published up to 2006 for 
the sample APS95. 
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FIG. 8: Predictive power of each indicator at year 1994 for 
the value of the same indicator for the papers published in 
the 1995 - 2006 time interval for the sample APS95. 



FIG. 10: Predictive power of each indicator at year 1994 for 
the value of the same indicator at year 2006 for the sample 
APS95. 



V. COMBINING h AND iV c 

Our results indicate that the h-index and the total 
number of citations are better than the number of pa- 
pers and the mean citations per paper to predict future 
achievement, with achievement defined by either the in- 
dicator itself or the total citation count N c . Further- 
more, we found a small consistent advantage of the h- 
index compared to N c . 

It has been argued in the literature that one drawback 
of the h-index is that it does not give enough 'credit' to 
very highly cited papers, and various modifications have 
been proposed to correct this, in particular Egghe's g- 
indexpl, Jin's AR indexQ and Komulski's H^> indexp. 



These modified indices reward authors with higher ci- 
tation numbers in the papers that contribute to the h- 
count. 

To test the possibility that giving a higher weight to 
highly cited papers may enhance the predictive power of 
the h-index, we considered the following expression: 

h a = Vh 2 + aN c (1) 

and asked the question: which value of a will result in 
h a {t\) best predicting the citation count of future work, 
N c (ti,t 2 )^ That is, we considered the cases of Fig. 4(a) 
and Fig. 7(a) with h a {t\) in the abscissa instead of h{t\). 

The resulting correlation coefficients as function of a 
are shown in Fig. 11. Surprisingly, a small negative 
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FIG. 11: Correlation coefficient r between N c (ti,t2) and h a 
defined in Eq. (2) for samples PRB80 and APS95. As a 
increases, the curves approach the asymptotic values given 
by the dashed lines, r = 0.53 and r = 0.43 respectively 

a (a ~ —0.1) yields the largest correlation coefficient 
in both samples considered. For positive a, the corre- 
lation coefficients decrease monotonically and approach 
the values corresponding to the predictor N c , r — 0.53 
and r = 0.43 respectively, corresponding to Figs. 4(b) 
and 7(b). 

Consequently the best predictor of future achievement 
(achievement defined as number of citations) inferred 
from our data (e.g. sample PRB80) would be a linear 
regression fit to y/ N c (ti,t2) versus h a (ti) with a = —0.1 
(correlation coefficient r = 0.62), leading to the para- 
doxical result that given two researchers with the same 
h-index, the one with lower N c (ti) should be expected to 
earn higher number of citations in the subsequent time 
period. 

By using the relation N c = ah 2 we can rewrite Eq. (1) 

as 

h a = hyl + aa (2) 

The fact that a negative a yields larger predictive power 
indicates that authors with large values of a — N c /h 2 
are on average less likely to earn a larger number of ci- 
tations in future work than authors with smaller a. We 
believe that this effect is principally due to the effect of 
coauthorship discussed in the next section. 

VI. DISCUSSION 



predictor of productivity (N p ) than N p itself. Further- 
more, in attempting to combine h with N c to enhance 
the predictive power of h we found that iV c should enter 
with a negative weight. 

It is interesting and not obvious that the h-index is able 
to predict both itself and the productivity N p better than 
N p can predict itself. Perhaps it indicates that some of 
the prolific authors with small citation counts feel less of 
an incentive to continue being prolific as they perceive 
that their work is not having impact. 

We believe the superiority of h compared to N c as a 
predictor is due to the issue of coauthorship, touched on 
in ref. [l[ and in the introduction. Let us elaborate on 
this further. 

Consider a paper j with N£ citations coauthored by 
several scientists with different levels of seniority as well 
as of ability, each of which made different contributions 
to this paper. If we are counting citations, each coauthor 
gets the same 'credit', i.e. adds Njl citations to his/her 
total citation count, independent of his/her individual 
contribution to this paper. 

Instead, if we are considering h, this paper will or will 
not contribute to the i-th author's h-index, hi, depending 
on whether N% > hi or N[ < hi. If it contributes, that 
author only 'needs' hi of the N-! citations to increase 
his/her h by one. So one may say that each author i gets 
'allocated' only hi of the Nj! citations. Junior, as well 
as less able authors, are likely to have a lower h than 
senior and more able authors, and they are likely to have 
made a lesser contribution to the paper 3 . Hence it is 
appropriate that they benefit from a smaller portion of 
the total Nl . 

In other words, to 'first order', using h rather than 
N c as a measure of scientific achievement automatically 
reduces an important source of distortion when multiply 
coauthored papers are involved, by allocating a smaller 
portion of the credit to those authors that are likely to 
have contributed less. The argument is not foolproof 
and exceptions undoubtedly will occur, but the 'injustice' 
done to powerful junior coauthors in the early stages of 
their career will automatically be remedied in due time 
as their h-index rapidly increases. 

Furthermore it is interesting and revealing that the ad- 
vantage of h over N c in predicting future N c values is lost 
when cumulative rather than only citations to new pa- 
pers are considered (Fig. 2 vs. Fig. 4, Fig. 9 vs. Fig. 
7). We suggest that this also reflects the effect of coau- 
thorship. Highly cited papers in the initial period will 
usually continue to garner a high number of citations in 
the subsequent period also for those among the coau- 
thors that made only minor contributions to the paper. 



In summary, we found that the h-index appears to be 
better able to predict future achievement than the other 
three indicators, number of citations, number of papers 
and mean citations per paper, with achievement defined 
by either the indicator itself or the total citation count 
N c . In addition, the h-index was found to be a better 



3 Of course it will often be the case that a junior coauthor will have 
performed most of the actual work for the paper. Nevertheless, if 
the paper has senior coauthors and ended up with a large number 
of citations, it will often be the case that the senior coauthor/s 
will have played the crucial role. 
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While the cumulative citations of those individuals will 
be high, they should be less likely to make major new 
contributions in the subsequent period. Thus we argue 
that even for a decision focused on optimizing expected 
cumulative achievement h should be favored as an in- 
dicator as it appears better able to predict individual 
cumulative achievement. 

Other recently proposed bibliometric measures that 
give more weight to very highly cited papers such as 
Egghe's g-indexjj], Jin's AR indexQ and Komulski's H 2 
indexQ are likely to suffer from the same drawback as 
N c , since they will assign the citations of highly cited 
papers equally to all coauthors without discrimination. 
Thus we conjecture that the predictive power of these 
modified indices is likely to be worse than that of the 
h-index, as our analysis of the h a index (Eq. 2) also 
suggests. 

With respect to the indicator n c , mean number of cita- 
tions per paper, our results indicate that it has very little 
predictive value. The low correlation found between n' c s 
in the different timeframes (initial 12 years and subse- 
quent 12 years) is due to a variety of reasons. In some 
cases the individual's productivity N p remained similar 
but the total impact N c changed substantially, some- 
times both productivity declined and the total impact de- 
clined even more, sometimes productivity increased and 
the mean impact per paper also increased. 

These results are at odds with the conclusions of the 
recent study by Lehmann, Jackson and Lautrup 0, 
They start from the reasonable assumption that "the 
quality of a scientist is a function of his or her full citation 
record" and address the question of which single-number 
indicator is best to discriminate between scientists, aim- 
ing "to assign some measure of quality". They argue 
that an indicator is of no practical use unless "the un- 
certainty in assigning it to individual scientists is small" . 
They perform a Bayesian analysis of citation data from 
a large sample extracted from the SPIRES database [7J, 
and find that among the three indicators (i) mean num- 
ber of citations per paper (n c ), (ii) number of papers 
published per year (N p /n), and (iii) ft,-index, n c is far 
superior in discriminating between scientists. They con- 
clude that [cjompared with the h-index, the mean number 
of citations per paper is a superior indicator of scien- 
tific quality, in terms of both accuracy and precision, and 
hence that "the mean or median citation counts (per pa- 
per) can be a useful factor in the academic appointment 
process" . 

Bornmann and Daniel [8| echo their conclusions and 
state that the Lehmann et al study "raises some doubt 
as to the accuracy of the (h-)index for measuring scien- 
tific performance", that "the mean, median, and max- 
imum numbers of citations are reliable and permit ac- 
curate measures of scientific performance" , and instead 



that " the h index is shown to lack the necessary accuracy 
and precision to be useful" . 

We argue that these conclusions are deeply flawed. Our 
results here have shown that the h-index is a far better 
predictor of future scientific achievement than the mean 
number of citations per paper, and surely the same would 
hold for the median. For example, the correlation coeffi- 
cient between the number of citations in the subsequent 
12 years and the h-index in the previous 12 years in sam- 
ple PRB80 was found to be r = 0.60, much larger than 
r = 0.21, the correlation found with the mean number 
of citations per paper in the previous 12 years. The h- 
index was also far superior at predicting itself, r = 0.61 
versus r = 0.23 for n c . Similar pattern was found in our 
other sample, and it is likely that similar results would 
be obtained with the sample used by Lehmann et al. 

This example illustrates the danger in using sophisti- 
cated mathematical analysis to jump to practical con- 
clusions (of sometimes life-changing consequences) in the 
delicate issues under considerations. While the Lehmann 
et al study may be correct in concluding that the mean 
number of citations is better to 'discriminate' between 
scientists for a given fixed time period according to their 
definition, the fallacy in their argument appears to be 
that this does not imply that the indicator is associated 
with an identifiable individual trait that would be ex- 
pected to persist with time. And certainly not with 'sci- 
entific quality'. Else, one is forced to conclude in light 
of the results of the present paper that 'scientific quality' 
(as defined by Lehmann et al) in the past is nearly uncor- 
rected with scientific quality in the future for individual 
scientists, a conclusion that defies common sense. 

Instead, a variety of studies [l| . [9j . [To| . (TT| . have shown 
that the h-index by and large agrees with other objective 
and subjective measures of scientific quality in a vari- 
ety of different disciplines [H, [H, Q M, M, 03, and 
the present study shows that the h-index is also effective 
in discriminating among scientists that will perform well 
and less well in the future. We conclude tentatively (as- 
suming that future empirical studies will corroborate the 
results of this paper) that the h-index is a useful indicator 
of scientific quality that can be profitably used (together 
with other criteria) to assist in academic appointment 
processes and to allocate research resources. 
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